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Abstract 

This piper presents the Integrated Dls'< 
Ca'jt;e (ITiC) system developed for ACOS scries 
larw:c-j:cale computers. The IDC features a 
unique file -adaptive control f-^r hii;h 
porformnnce gains and improve.*: -.1 aocps.-^ time 
by 1 factor of ap to 10. The IDC c^n be used to 
improve the throughput and recpons»» li.-re of an 
:/0 bound computer system. 



Introduction 

On«'' of the major performance bottlenecks of 
lary.e-ccale computer systems is said to be slow 
access time to disk data, which is associated 
with tlick mechanical motions. 

To remove this performance bottleneck, 
several programming and operational techniques 
have been used. They include decreasing the 
numb*?r of disk accesses by program 
modifications, reducing seek time by file 
n.^allocation and displacing. disk data to 
fast-access devices. However, these techniques 
are not only costly but also limited • in 
effectiveness and lack in adaptivity to dynamic 
changes: in system environments. 

One effective approach to solvinc; these 
problems is the use of a disk cache or buffered 
disk. This is a technique to decrease apparent 
disk access time by moans of automatically 
storing frequently used data in a high-speed 
buffer memory, which is provided between disks 
and main memory, 

A number of papers discussed the 
effectiveness of a disk cache ^» *• , but very 
little has been reported on the development of a 
disk cache which has been put to practical use. 
This paper describes the Integrated Disk Cache 
(IDC) system which has been developed for NEC's 
ACOS series large-scale computers. Unlike the 
k.nown disk cache 3, the IDC is integrated into 
a central input/output processor and features a 
unique file adaptive control for performance 
enhancement. The following sections describe 
the major design considerations, 5!ystem 
conf ii^urafcion, file adaptive control and 
performance gains for the IDC. 

Major Design Considerations 

In developing a disk cache system, the 



following requ ire rent. s should 
consideration, a.mong other.*?. 



he 



1 ) Per.''ormance 

In nrder to achieve high perfornance, 
frequently accessed data must be found with 
h:?:h s.^obabili ty in the cache. Sln'?e dat-a 
access characteristics widely vary, 
depending on files and appl i::at lona used, a 
disk cache should be adaptive to these 
various conditions. 

2) He I iab i 1 i ty and ava 1 la b i I L ty 

Since di::ks contain critical systen da:.a, 
sufficient care should be taken to retain 
data intet^rlty. This requires nnt only that 
a disk cache be highly reliable, but also 
that it should be able to recover data if -a 
failure occurs. 

3) Tr3n.*;parency 

The function of a disk cache, differing from 
a high speed file device, is to improve 
;^pparent or effective disk access time. Its 
existeni-:e, therefore, should be as 
transparent as possible at a user level, at 
an operating system level and at a hardware 
level. 

It is difficult, however, to meet all of the 
three require-ments to the same degree. From 
performance ami availability viewpoints, strict 
transparency is not desirable, because file 
categories have considerable influence on 
efficient cache operations, as follows: 

First, disk files are roughly grouped into 
random file or sequential file categories. Disk 
cache performance gains for these two file 
categories are based upon completely different 
operations: One involves repeated access to same 
data. The other involves access to prefetched 
data. It is difficult, therefore, to obtain 
high performance for both of these operations 
using a single control. Second, disk files are 
also grouped into permanent file or temporary 
file categories. While a permanent file should 
always be kept in a disk, a temporary file is 
allocated and used only during a task or job 
execution; it is not absolutely necessary for it 
to be stored on a disk. Therefore, separate 
handling of these two kinds of files would allow 
significant performance improvement, yet 
a.'isuring data availability. 
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Thus U was decided to ILmiC the IDC 
transoarency to an extent that ACQS operating 
system sees the existence of a disk cacne and 
can provide it with the information on fUe 
categories. ' The disk cache. based on this 
information. controls the caching operations 
adaptively according to each file category. 
This is called a file adaptive control m this 
paper. 

S ystem Configuration and Ba sic Operations 
System Configuration 

The IDC consists of a high speed buffer 
memory, called a disk cache, and a controller 
integrated into the Input/Output Processor 
(lOP), as shovm in Fig. 1. 

The disk cache includes up to 15 megabytes 
of MOS random access storage which stores 
frequently used data under the lOP control. It 
is connected to the System Control Unit of the 
ACOS system and is shared by all the disks in 
the svstem. The data in the disk cache can be 
transferred to and from the main memory at a 
speed of up to 5.3 megabytes per second, which 
is three to six times faster than that of a 
conventional disk. Needless to say. seek time 
and latency tiioe are not required to start che 
data transfer. . . . 

The lOP is an intelligent controller which 
handles all the input/output transactions issued 
by the central processing unit (CPU). This 
capability is used to control the IDC. For this 
purpose, a control program called Disk Cache 
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Fig, 1 IDC system configuration 



handler is integrated into the main program of 
the lOP. The Disk Cache handler, interfacing 
with ACOS operating system, controls all the IDC 
operations, including directory management for 
locating the accessed data, data transfer 
between the disks and the disk cache, and data 
transfer between the disk cache and the main 
memory. 

Sector Addressing and Data Mapping 

Although the record read or written by the 
operating system is variable in length, data are 
recorded on a disk in fixed blocks, called 
sectors- A number of sectors make up the 
record. The sector has a unique address and all 
the accesses to the disk data are made using 
this address. This addressing allows effective 
data naopirg and adaptive control by the IDC. 
independent of the physical structure of disk 
devices being used. 

Like a CPU cache, the disk data are mapped 
into the disk cache using a set associative 
mapping, as shown in Pig. 2. The basic mapping 
unit is a block which consists of a number of 
sectors. The block size as well as the number 
of sets and the number of levels are adjustable 
to each user environment. 
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Basic operations 

Before explaining the file adaptive control, 
basic cache operations are described here. 

(1) Read Operation ^ ■ H<«ir 

ir the accessed record is found in the disk 
cache, the record is transferred to the main 
memory. This completes the read operation. 
If the record is not found in the disk 
cache, the data block which contains that 
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record is read frora a disk Into the disk 
cache and the record is transferred to the 
main memory. If r.o free cache blcclc is 
available, least recently used (LRU) cache 
block is replaced. 

(2) Write Operation 

The record is always written onto a disk. 
If an old copy of the record is found in the 
disk cache, the record is also transferred 
to the disk cache to update the copy. In 
either case, an I/O completion is sent to 
the operating systen after the record has 
been written onto the disk. The disk access 
time, therefore, is not improved by the disk 
cache. However, since this operation 
guarantees that the data in the disk are 
always what the operating system assumes, no 
change is required in the error handling 
processing. 

File Adaptive Control 

Four Operation Modes 

Four operation modes are provided in the IDC 
to realize high performance while assuring 
sufficient availability (Table 1). These are; 
(1) Basic mode, (2) Sequential File mode, (3) 
Temporary File mode and (U) High Speed File 
mode. Modes (1), (2) and (3) can be specified 
Independently at each I/O request. Mode (^) can 
be specified on a file basis. 

(1) Basic mode 

This is the mode intended for a random file 
processing and performs the basic operations 
described in the previous section. 

(2) Sequential File mode 

The following two operations are added to 
the basic operations to utilize the nature 
of sequential file processing. 

(a) When the record to be read is not found 
in the cache, disk blocks which contain n 
records are prefetched Into the cache for 
subsequent use. This guarantees an average 



hit ratio fa probability that a record is 
found in the cache) of 'In-w/n for any 
sequential file processing. For example, 
assuming a record longer t*:an a cache block, 
the cache hit ratio goos up from 0 J to 67 J 
by applying this control with n=3- Another 
merit of this control is that it illows the 
cache block size to be decreased or tuned to 
fit other file categories, irrespective of 
the sequential file performance. 

(b) A cache block, whose last record has 
been accessed, is placed at the head of the 
LRU list for immediate use by other filer? . 
This is based on the fact that the same data 
is rarely accessed again soon "in a 
sequential file processing. Thir> control 
realizes efficient usage of cache space and 
an increased overall hit ratio. Without 
this control, one sequential file could 
consume all the cache blocks. However, with 
this cont.'*ol, the sequential file is 
confined to a few blocks within one level. 

(3) Temporary rile mode 

A temporary file processing usually starts 
with writing a new record, followed by 
eventual reading of the record. Therefore, 
this mode is designed to operate in such a 
way that a record is always written into the 
disk cache, assigning a new block if 
necessary. This control increases the hit 
ratio in read operations. For example, 
assuming a work file processing involving 
writi.ng a small amount of data and reading 
it at once, read hit ratio goes up from 0 % 
to 100 % by adding this control to the basic 
mode . 

(U) High Speed File mode 

Whereas all the modes described - above 
improve only read access time, this mode is 
designed to decrease both read and write . 
access times. One way to achieve this 
objective is to notify an I/O completion to 
the operating system, immediately after a 
record has been written into the disk 
cache. The record is written onto a disk 



Table 1 Summary IDC operations 
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thereafter. This method, however, haa the 
following problem. When a x-aiiure occurs 
and the record cannot be successfully 
written onto a disk, there is no way to 
notify this event to the operating system 
which has already finished the I/O 
processing. assuming its successful 
completion. Allowing such a dual 

notification requires significant 

modifications in the error handling and 
recovery routines for the operating system. 
To avoid this problem, it was decided not to 
write a record onto a disk but instead to 
restrict the use of this mode to a temporary 
file only. 

The High Speed File mode operates as if a 
portion of file space is allocated in the 
disk cache. A file to be handled in this 
mode must be registered in the IDC. In 
other words, the operating system first 
sends to the IDC its size and start address 
on disk, and sends a release command when 
the file processing finishes. When disk 
records in the registered file area are 
accessed, the records are allocated m the 
disk cache block by block. Once allocated, 
these blocks remain in the disk cache until 
a release command is received. When no 
cache space is available, the records are 
handled as a Temporary File mode and are 
written onto a disk. To prevent this mode 
from exploiting the cache, an upper capacity 
limit is provided for this mode. 

Control Interface 

Figure 3 shows the control interfar- 
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Fig. 3 IDC control interface 



structure for the file adaptive control. For 
the interface between the operating system and 
the IDC, a new control field is provided in a 
channel program control word. In this field, 
the Input Output Supervisor specifies, for each 
disk I/O operation, IDC on/off or an operation 
mode to be used- The IDC, based on this control 
field, dynamically changes its operation mode 
and performs the appropriate operations. In the 
High Speed File mode, a newly provided file 
registration/release command is also used. 

For a user interface, several system 
defaults are provided • for easy IDC control. 
These include automatic IDC application to all 
sequential files, TSS swap files, system library 
file etc. An end user, if necessary, can also 
control the IDC use on a file basis. For this 
purpose, an IDC parameter was added to a job 
control card. 

Error Recovery 

The disk cache is designed to have 
sufficiently high reliability through its error 
correcting capability. If an uncorrectable 
error occurs, however, the associated block is 
removed from the system and the IDC continues 
its ooeration in a degraded mode. When data m 
the disk cache becomes inaccessible, due to some 
hardware failure, the I/O request involved in 
the failure is converted to a usual disk I/O 
operation. Except for the High Speed File mode, 
this conversion is always possible because the 
IDC is controlled in such a way that the data m 
the disk cache always has its copy on a disk. 
In the High Speed File mode, the I/O request 
involved in the failure ends In an I/O device 
error. 

In summary, when an IDC failure occurs, the 
IDC is simply bypassed in either mode other than 
the High Soeed File mode. In the latter mode, 
only a job involved in the failure is aborted. 

Performance Gains 
Access Time laprovement 

Disk record read time (t). when using the 
IDC, is given by 

t = p.r + (1-p) (b + r) 
= r ^ (l-p)b 

where 

p : disk cache hit ratio 
r : record read time from the disk cache 
b : block(s) load time from a disk into 
the disk cache 
Based on this formula, the average effective 
disk access time is calculated and compared with 
the conventional disk access time in Fig. ^, 

As can be seen from this figure, an 
iraorovement factor of more than 10 is obtained 
by the IDC for reading the data in the disk 
cache or accessing the data in the High Speed 
File mode. In other cases, however, the data 
access time depends on the cache hit ratio and 
the percentage of read access occurrences. ir 
80 - QO S hit ratio and 6 : U read/write ratio 
is assumed, as is often found to be the case. 
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Sa.TipIe Measurement 

The improver.er.t in throughput or response 
time, which is obtained by applying the IDC, 
depends on the application, system operation., 
parameters and the IDC parameters among other 
factors. Therefore, the performance gains 
obtn Lned by the IDC should be discussed 
.separately for each user system. This Is beyond 
the scope of this paper. Here, only sample IDC 
performance gains are shown in Table 2, which 
are based upon the measurenient of user model 
jobs . 



Appiicafion 


Improvement 


Hit ratio 


CompilQiion 


I0-207o(elopsed time) 


85-90% 


COBOL 


10-35 { " ■ ) 
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10-30 ( - } 


60-90 


TSS 
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3330 like disk Disk cache :0.5-6Mbytes 
Note:High Speed File mode is not used. 

Tabic* .? S-iripl^ IOC porf'vrmanc*? mt*as:jr«.r.en t 
Cone 1ms ior. 

The Interirited Disk Ca-.^-;^ .v/stem, whlr^h 
inclMriet' up to l6 megabytes of MGS ir.-.Tiory, ^.a.s 
been Jeve l-po-ji . From ;.orformano** ar:l 

re li -ability vlewpoiats, th»? IDC employs rilr' 
adaptive control, in which four operation i!0'!p:'. 
a-e* provided -ir.d tr.eir usr ',3 specified by the 
operating f^yst^m. The IDC i-nproves the av^rag^ 
disk access ti.r.e by a factor of two to ten and 
user job performance by 10 t'j 2*5 %. Fil-^ 
adaptive control, lii<e the one presented he-e, 
is thought to be an effective metho<l to "'--alir:!* 
a hi?;h performance disk cache system. 
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